Goto

Collaborating Authors

 literature survey


Graphy'our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data

Lai, Longbin, Luo, Changwei, Lou, Yunkai, Ju, Mingchen, Yang, Zhengyi

arXiv.org Artificial Intelligence

Large Language Models (LLMs) have recently demonstrated remarkable performance in tasks such as Retrieval-Augmented Generation (RAG) and autonomous AI agent workflows. Yet, when faced with large sets of unstructured documents requiring progressive exploration, analysis, and synthesis, such as conducting literature survey, existing approaches often fall short. We address this challenge -- termed Progressive Document Investigation -- by introducing Graphy, an end-to-end platform that automates data modeling, exploration and high-quality report generation in a user-friendly manner. Graphy comprises an offline Scrapper that transforms raw documents into a structured graph of Fact and Dimension nodes, and an online Surveyor that enables iterative exploration and LLM-driven report generation. We showcase a pre-scrapped graph of over 50,000 papers -- complete with their references -- demonstrating how Graphy facilitates the literature-survey scenario. The demonstration video can be found at https://youtu.be/uM4nzkAdGlM.


Instruct Large Language Models to Generate Scientific Literature Survey Step by Step

Lai, Yuxuan, Wu, Yupeng, Wang, Yidan, Hu, Wenpeng, Zheng, Chen

arXiv.org Artificial Intelligence

Abstract. Automatically generating scientific literature surveys is a valuable task that can significantly enhance research efficiency. However, the diverse and complex nature of information within a literature survey poses substantial challenges for generative models. In this paper, we design a series of prompts to systematically leverage large language models (LLMs), enabling the creation of comprehensive literature surveys through a step-by-step approach. Specifically, we design prompts to guide LLMs to sequentially generate the title, abstract, hierarchical headings, and the main content of the literature survey. We argue that this design enables the generation of the headings from a high-level perspective. During the content generation process, this design effectively harnesses relevant information while minimizing costs by restricting the length of both input and output content in LLM queries. Our implementation with Qwen-long achieved third place in the NLPCC 2024 Scientific Literature Survey Generation evaluation task, with an overall score only 0.03% lower than the second-place team. Additionally, our soft heading recall is 95.84%, the second best among the submissions. Thanks to the efficient prompt design and the low cost of the Qwen-long API, our method reduces the expense for generating each literature survey to 0.1 RMB, enhancing the practical value of our method.


An experiment on an automated literature survey of data-driven speech enhancement methods

Santos, Arthur dos, Pereira, Jayr, Nogueira, Rodrigo, Masiero, Bruno, Sander-Tavallaey, Shiva, Zea, Elias

arXiv.org Artificial Intelligence

The increasing number of scientific publications in acoustics, in general, presents difficulties in conducting traditional literature surveys. This work explores the use of a generative pre-trained transformer (GPT) model to automate a literature survey of 116 articles on data-driven speech enhancement methods. The main objective is to evaluate the capabilities and limitations of the model in providing accurate responses to specific queries about the papers selected from a reference human-based survey. While we see great potential to automate literature surveys in acoustics, improvements are needed to address technical questions more clearly and accurately.


Towards Resilient Artificial Intelligence: Survey and Research Issues

Eigner, Oliver, Eresheim, Sebastian, Kieseberg, Peter, Klausner, Lukas Daniel, Pirker, Martin, Priebe, Torsten, Tjoa, Simon, Marulli, Fiammetta, Mercaldo, Francesco

arXiv.org Artificial Intelligence

Artificial intelligence (AI) systems are becoming critical components of today's IT landscapes. Their resilience against attacks and other environmental influences needs to be ensured just like for other IT assets. Considering the particular nature of AI, and machine learning (ML) in particular, this paper provides an overview of the emerging field of resilient AI and presents research issues the authors identify as potential future work.